A Quantiication of Distance-bias between Evaluation Metrics in Classiication

نویسندگان

  • Ricardo Vilalta
  • Daniel Oblinger
چکیده

This paper provides a characterization of bias for evaluation metrics in classiication (e.g., Information Gain, Gini, 2 , etc.). Our characterization provides a uniform representation for all traditional evaluation metrics. Such representation leads naturally to a measure for the distance between the bias of two evaluation metrics. We give a practical value to our measure by observing if the distance between the bias of two evaluation metrics correlates with diierences in predictive accuracy when we compare two versions of the same learning algorithm that diier in the evaluation metric only. Experiments on real-world domains show how the expectations on accuracy diierences generated by the distance-bias measure correlate with actual diierences when the learning algorithm is simple (e.g., search for the best single-feature or the best single-rule). The correlation, however, weakens with more complex algorithms (e.g., learning decision trees). Our results show how interaction among learning components is a key factor to understand learning performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation Metrics in Classiication: a Quantiication of Distance-bias

This paper provides a characterization of bias for evaluation metrics in classiica-tion (e.g., Information Gain, Gini, 2 , etc.). Our characterization provides a uniform representation for all traditional evaluation metrics. Such representation leads naturally to a measure for the distance between the bias of two evaluation metrics. We give a practical value to our measure by observing if the d...

متن کامل

Evaluation Metrics in Classification: A Quantification of Distance-Bias

This paper provides a characterization of bias for evaluation metrics in classification (e.g., Information Gain, Gini, χ, etc.). Our characterization provides a uniform representation for all traditional evaluation metrics. Such representation leads naturally to a measure for the distance between the bias of two evaluation metrics. We give a practical value to our measure by observing the dista...

متن کامل

Review of ranked-based and unranked-based metrics for determining the effectiveness of search engines

Purpose: Traditionally, there have many metrics for evaluating the search engine, nevertheless various researchers’ proposed new metrics in recent years. Aware of this new metrics is essential to conduct research on evaluation of the search engine field. So, the purpose of this study was to provide an analysis of important and new metrics for evaluating the search engines. Methodology: This is ...

متن کامل

Submitted to CVPR ' 99 Discriminant Analysis based Feature ExtractionW

We propose a new feature extraction scheme called Discriminant Component Analysis. The new scheme decomposes a signal into orthonormal bases such that for each base there is an eigenvalue representing the discriminatory power of projection in that direction. The bases and eigenvalues are obtained based on certain classiication criterion. For simplicity, a criterion used in Fisher's Discriminant...

متن کامل

Handing the Microphone to Women: Changes in Gender Representation in Editorial Contributions Across Medical and Health Journals 2008-2018

The editorial materials in top medical and public health journals are opportunities for experts to offer thoughts that might influence the trajectory of the field. To date, while some studies have examined gender bias in the publication of editorial materials in medical journals, none have studied public health journals. In this perspective, we studied the gender ratio ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000